A novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition
نویسندگان
چکیده
This paper describes an attempt to extract multiple peripheral features of a point x(ti,fj) on a timespectrum (TS) pattern by observing n×n neighborhoods of the point, and to incorporate these peripheral features (MPFPs: multiple peripheral feature planes) into the feature extractor of a speech recognition system together with MFCC parameters. Two types of peripheral feature extractor, MPFP-KL and MPFP-LR, are proposed. MPFP-KL adopts the orthogonal bases extracted directly from speech data by using KLT of 7×7 blocks on TS patterns. In MPFP-LR, the upper two primal bases are selected and simplified in the form of ∆t-operator and ∆foperator obtained by linear regression calculation. MPFP-KL and MPFP-LR show significant improvements in comparison with the standard MFCC feature extractor in experiments with the HMM-based ASR system.
منابع مشابه
Feature extraction for speech recognition based on orthogonal acoustic-feature planes and LDA
This paper describes an attempt to extract multiple topological structures. hidden in time-spectrum (TS) patterns, by using multiple mappmg operators, and to incorporate the operators into the feature extractor of a speech recognition system. In the previous work, the author proposed a novel feature-extraction method based on MAFP/KLT (MAFP: multiple acoustic-feature planes), in which 3x3 deriv...
متن کاملA novel feature-extraction for speech recognition based on multiple acoustic-feature planes
This paper describes an attempt to incorporate the tinctions of the auditory nerve system into the feature extractor of speech recognition. The tinctions include four types of well-known responses to sound stimuli: local peaks of spectrum in steady sound. ascending FM sound, descending FM sound, and sharply rising and falling sound. Each function is realized in the form of a three-level derivat...
متن کاملCanonicalization of feature parameters for automatic speech recognition
Acoustic models (AMs) of an HMM-based classifier include various types of hidden variables such as gender type, speaking rate, and acoustic environment. If there exists a canonicalization process that reduces the influence of the hidden variables from the AMs, a robust automatic speech recognition (ASR) system can be realized. In this paper, we describe the configuration of a canonicalization p...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملSpeaker Dependent Speaker Recognition Using Svm and Hmm
Speaker recognition is the process of recognizing the speaker based on characteristics such as pitch, tone in the speech wave.Background noise influences the overall efficiency of speaker recognition system and is still considered as one of the most challenging issue in Speaker Recognition System (SRS). Support Vector Machine (SVM) and Hidden Markov Model (HMM) are widely used techniques for sp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000